Improving Polish Mention Detection with Valency Dictionary
نویسنده
چکیده
This paper presents results of an experiment integrating information from valency dictionary of Polish into a mention detection system. Two types of information is acquired: positions of syntactic schemata for nominal and verbal constructs and secondary prepositions present in schemata. The syntactic schemata are used to prevent (for verbal realizations) or encourage (for nominal groups) constructing mentions from phrases filling multiple schema positions, the secondary prepositions – to filter out artificial mentions created from their nominal components. Mention detection is evaluated against the manual annotation of the Polish Coreference Corpus in two settings: taking into account only mention heads or exact borders.
منابع مشابه
Resources for Extending the PolNet - Polish WordNet with a Verbal Component
The paper presents the initial, basic step in extension of PolNet (Polish WordNet) with verbs. This step consists in formatting the source data necessary for final computerassisted creation of verbal synsets including valency information. An algorithm for compiling verb descriptions contained in two human-oriented dictionaries into a computer tractable electronic resource is presented.
متن کاملVerbal Valency in the MT Between Related Languages
The paper analyzes the differences in verbal valency frames between two related Slavic languages, Czech and Russian, with regard to their role in a machine translation system. The valency differences are a frequent source of translation errors. The results presented in the paper show that the number of substantially different valency frames is relatively low and that a bilingual valency diction...
متن کاملA Method of Adding New Entries to a Valency Dictionary by Exploiting Existing Lexical Resources
Information on subcategorization and selectional restrictions in a valency dictionary is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information is both time consuming and costly. In this paper we present a method of assigning valency information and select...
متن کاملExtending The Coverage Of A Valency Dictionary
Information on subcategorization and selectional restrictions is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information to a valency dictionary is both time consuming and costly. In this paper we present a method of assigning valency information and select...
متن کاملEvaluation of a Method of Creating New Valency Entries
Information on subcategorization and selectional restrictions is important for natural language processing tasks such as deep parsing, rule-based machine translation and automatic summarization. In this paper we present a method of adding detailed entries to a bilingual dictionary, based on information in an existing valency dictionary. The method is based on two assumptions: words with similar...
متن کامل